RANDOM MATRICES: LOCALIZATION OF THE EIGENVALUES 
AND THE NECESSITY OF FOUR MOMENTS 
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Cm Abstract. Consider the eigenvalues Ai(M„) (in increasing order) of a random 

J^T) Hermitian matrix M„ whose upper-triangular entries are independent with 

1-^ mean zero and variance one, and are exponentially decaying. By Wigner's 

^^ semicircular law, one expects that Xi{M„) concentrates around "fiy/n, where 

^^ Jj'* psc(x)dx = — and psc is the semicircular function. 

^sj In this paper, we show that if the entries have vanishing third moment, 

,_^ then for all 1 < j < n 



^~~? E|Ai(M„) - v^7i|^ = 0(min(n"'=min(j,n + l- J)"2/3„2/3^„l/3+£-)-)^ 

'^ ' for some absolute constant c > and any absolute constant e > 0. In partic- 

^^ ular, for the eigenvalues in the bulk (niin{j, n — i} = 0(n)), 

^ E|A,{M„)-v^7,|2=0(n-'=). 

Cw A similar result is achieved for the rate of convergence. 

S As a corollary, we show that the four moment condition in the Four Moment 

I I Theorem is necessary, in the sense that if one allows the fourth moment to 

change (while keeping the first three moments fixed), then the mean of \i{Mn) 

^^ changes by an amount comparable to n~^ " on the average. We make a precise 

^ conjecture about how the expectation of the eigenvalues vary with the fourth 

^— H moment. 

o 

(N 

•/^ Key words: Random matrices, localization, rate of convergence, four moment the- 

o 

> 

•^ 1. Introduction 

X 

"^ This note is concerned with the local eigenvalue statistics of the following random 

matrix model. 

Definition 1.1 (Wigner matrices). A Wigner matrix is a random hermitian matrix 
Mn = iCij)i<i,3<n such that 

• The Qj for 1 < i < j < n are independent with mean zero and variance 
one, and Q^ = Qj] 
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2 TERENCE TAO AND VAN VU 

• For I < i < j < n, C^ij are identically distributed, with the real and imagi- 
nary parts of Qij being independent and identically distributed with distri- 
bution 77; 

• For 1 < i < n, the Qa are identically distributed with distribution f)\ 

• (Uniform exponential decay) There exist constants C, C" > such that 

(1) P(|C,l>i^)<exp(-t) 
for all t > C and 1 < i,j < n. 

We refer to 77, f) as the atom distributions of M„. 

A classical example of a Wigner matrix is the Gaussian unitary ensemble (GUE), 
in which rj and J) being the normal distribution with mean zero and variances 1/2, 
1 respectively. 

Remark 1.2. Wigner's matrices are not the most general random matrix model for 
which the results here are applicable, but we restrict to this case for simplicity. The 
results, for example, hold for real matrices, in particular Bernoulli matrices. 

A Wigner matrix Af„ has n real eigenvalues 

Ai(M„)<...< A„(M„). 

The global distribution of these eigenvalues has been known since the 1950s, and is 
described by the famous Wigner semicircular law, which asserts that the empirical 
spectral measure 

1 " 

1=1 

converges almost surely (in the vague topology) to the semicircular distribution 
Psc(x) dx, where 

■n 

It is of interest to understand the distribution of individual eigenvalues Ai(Af„). If 
for each 1 < i < 71 we define the classical location 7^ of the normalised z"* eigenvalue 
by the formula 

n^ i 

(2) / psc{x)dx = -. 

J-00 n 

then the Wigner semicircular law (combined with an almost sure bound of (2 -|- 
o(l))i/n for the operator norm of Af„, due to Bai and Yin[|]) is equivalent to the 
assertion that one has 

(3) A,(Af„) = 7,V^ + o(^^) 

uniformly for 1 < i < n, almost surely as n — > 00. If we ignore the o{\/n) error in 
([3]), we are thus led to the heuristic 

(4) A,+i(Af„) - A,(A/„) « min(z, n - ly'^^n-'/"^ 
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for the i eigenvalue spacing. In particular, this spacing should be comparable to 
^-1/2 jj^ tj^g bulk region Sn < i < (1 — S)n (for any fixed S > 0), and as large as 
n~^/^ at the edge of the spectrum. Note though that this derivation is non-rigorous 
as the o{^/n) error could be much larger than the expected gap size Q. 

In the last few years, there has been much progress in formalising the above 
heuristics and obtaining more precise control on the distribution of the eigenvalues 
and their spacings: see e.g. [T7], [S], [H], [5], ^\, [H]. A recent survey of these 
topics can be found in fTO]. 

It is natural to ask whether the o(-y/n) error in (|3| can be improved. In [T], the 
Talagrand concentration inequality was used to establish (among other thingsrj) 
that 

(5) A,(M„) = MA,(Af„) + Oin" min(i, n + 1 - i)) 

with probability at least 1 — OA,e{n~^) (say) for any A,e > 0, where MAi(M„) is 
the median of Ai(M„); see also [11] for closely related results. This was improved 
in [H] to 

(6) A,(M„) = MA,(Af„) + 0{n^ min(i, n + 1- i)^/^) 

with the same probability of 1 — OA.ei'iT' )■ In the bulk region 6n < i < {1 — S)n, 
the concentration of measure arguments in |llj give the bound 

(7) Ai(M„) = v^7, + 0{n^^^+^ min(i, n + 1 - i)-i/3„-i/6 



n 



with probability 1 — 0A.e,5{'n- ) whenever min(z,n + 1 — i) > ?i^"+^ (see Section 
[2] for further discussion of this bound) . 

In all the above estimates, the error term is larger than 1. In the recent paper [SJ 
Theorem 7.1], the bound 

n 

(8) 5]E|A,(M„)-7,y^|2 = 0(ni-^) 

i=l 

was established for some absolute constant c > 0. (Sec also the earlier result in 
[71 Theorem 6.3], which established (Is]) under an additional log-Sobolev hypothesis 
on the distribution.) In the bulk region Sn < i < {1 — 6)n, a significantly stronger 
localisation was obtained in [S] (see the equation preceding (7.8) in that paper), 
namely that Ai(M„) — 7^-^/71 + 0(n~-^/^+'^) with probability Oi.^A,s{n~^) for any 
e, A, with variants of this result also being obtained closer to the edge. This result 
was established via a strong bound on the convergence of the Stieltjes transform, 
which in turn was obtained by a lengthy moment method computation. 



Strictly speaking, the results in [I], [15j only establish the bounds (Isl, kI implicitly, and 
require in addition that the matrix entries are uniformly bounded, rather than exponentially 
decaying. However, the arguments in these papers can be easily extended to the exponentially 
decaying case, after a standard truncation argument to reduce to the case when the entries are 
bounded in magnitude by n^" (say), and replacing the median with a slightly shifted variant. 
We omit the details. 
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Our first main result gives an alternate method to establish eigenvalue localisation, 
based on the three moment theorem, rather than on combining the Stieltjes trans- 
form method with the moment method, and which gives a non-averaged version ([8]) 
(in the case when the third moment vanishes): 

Theorem 1.3 (Localisation). There is an absolute constant c > such that the 
following holds for any constant e > 0. Let M„ be a Wigner matrix whose atom 
distribution 7] has vanishing third moment Erj^ = 0. Then for all 1 < i < n, 

(9) E|A,(Af„) - \/^7,p = 0(min(n-'=min(z,n+ 1 -z)-2/37i2/3,ni/3+'^)). 

One can set c to be 1/1000 (say) and we make no attempt to optimize this constant. 
By summing over i, one obtains ([8]). Furthermore, in the bulk region 6n < i < 
(1 — (5)n, the theorem implies 

E|A,(M„)~V^7,|2 = 05(n-^). 

This is not as strong as the recent localisation result obtained in |8j, but the proof 
is shorter (assuming the three moment theorem) and will suffice for our applica- 
tions. In view of Q, the optimal bound on the right-hand side of ^ should be 
0(min(j, n — «)"^/*ri^^/3+^). 

Let Nj be the number of eigenvalues (of -^M„) in /, and define Fn{x) = ^EiV[_2.a;] • 
The quantity A :— sup^ \Fn{x) — J^^ Pscit)dt\ is of interest and has been investi- 
gated by many researchers (see |3 Chapter 8], [51 and the references therein). In 
these papers, it has been shown that A = 0(n^^'^) under various assumptions (the 
most general one seems to be in [9 which only requires bounded fourth moment). 



The arguments in the proof of Theorem 1.3 can be used to break the n ^ " barrier, 
under the extra third moment condition: 

Theorem 1.4. There is an absolute constant c > such that the following holds. 
Let M„ be a Wigner matrix whose atom distribution rj has vanishing third moment 
E?/3 = 0. T/ien A = 0(^-1/2-=). 



We prove this theorem in Section 2.1 The bound n ^'^ '^ can be improved to 
n"^"'"'^ if we use median instead of expectation in the definition of -F„ (see Remark 



2.6). A related result was proven (using different methods) recently in ^8, Theorem 
6.3], namely that 



n _ ^ 

with probability 1 — 0{n^^ for any fixed A, without a third moment hypothesis. 



Next, we give an application of Theorem 1.3 to demonstrate the sharpness (in 
some sense) of the four moment theorem., introduced by the authors in [181 119] 
in order to study the distribution of eigenvalues of random matrices. We state a 
special case of this theorem here: 

Theorem 1.5 (Four Moment Theorem). For all sufficiently small Cg > the fol- 
lowing holds. Let M„, M^ be two Wigner random matrices whose atom distributions 



NECESSITY OF FOUR MOMENTS 



77,77' have matching moments to fourth order, thus 'Erj^ = £(77')-' for j — 3,4. Let 
G : R ^>- R be a smooth function obeying the derivative bounds 

(10) \G^^^x)\ < n''° 

for all < j < 5 and x GM.. We abbreviate Xi :— Xi{M„) and A^ := Xi(M^). 



Then for n sufficiently large (depending on cq and the constants C, C in M) and 
all I < i < n one has 

(11) |EG(V^A,) - EG(V^A'J| < 77-^». 

// the atom distributions 77, 77' have matching moments only to order 3 rather than 



4 (i.e. E77 = £(77') j, then (11) still holds provided that one strengthens (10) to 



(12) |G(^')(x)| <77-^^'=° 

for all < j < 5 and x d M., and some absolute constant C . 

Proof. See [HI Theorem 15] (which handled the bulk case when 5n < i < [1 — 5)n) 
and [19", Theorem 1.13] (which handled the edge case). These theorems can also 
handle the joint distribution of several eigenvalues at once, as well as somewhat 
more general ensembles than those in Definition |1.1[ but we will not discuss these 
generalisations here. D 



We will refer to the second part of Theorem 1.5 as the three moment theorem. 



Roughly speaking. Theorem 1.5 asserts that the distributions of Ai(M„) and 
Xi{M'^) differ by 0(77^^/^^"^) for some c > if the atom distributions have matching 
moments to order 4, and by Oc{n~'^/'^^'^) for any c > if the atom distributions 
only have matching moments to order 3. For instance, for sufficiently large 77 one 
has 

P(Ai < a) < P(A^ < a + n-^/^-") + n-" 
for some c > if one has matching moments to order 4, and 

P(A, < a) < P(A^ < a + 77-^/2+'=) + 77-^= 

for all c > (with 71 sufficiently large depending on c) if one has matching moments 
to order 3. Morally speaking, this means that the medians MAi,MA^ of Xi, X'^ 
differ by 0(77^^'^^'^) when there are four matching moments and by Oc(77^^'^^'^) 
when there are three matching moments, although this is not quite rigorous due to 
the presence of the ti"'^ error in the above bounds. (For some rigorous bounds on 
the median of A^, see Section [2]) 

The matching moment conditions are essential to the method of proof of Theorem 
|1.5[ which uses a Taylor expansion argument. But it is natural to ask if these 
conditions are in fact necessary. Indeed, if one is not interested in the distribution 
of individual eigenvalues Ai(M„), but instead in the k-point correlation functions 
of these eigenvalues, then in the asymptotic limit 77 — )■ 00 (and with appropriate 
normalisations) , these correlation functions have a universal distribution regardless 
of how many matching moments there are; see |18l [5] (with earlier partial results in 
this direction in [T3], [5]). It is not hard to see that the universality for the hmiting 
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distributions (or joint distributions) of individual eigenvalues imply the universality 
of the fc-point correlation function. 

Our second main result below is evidence that the four moment hypothesis is 
indeed necessary if one wishes to control individual eigenvalues at the scale of the 
eigenvalue spacing Q. 

Theorem 1.6 (Necessity of fourth moment hypothesis). Let Mn,M^ be Wigner 
matrices whose atom variables ri,rj' satisfy Er/^ = £(77')"^ — but their fourth 
moment are different Ery^ 7^ £(77')*. As before, write Xi :— Ai(Af„) and X'^ :— 
Xi{M^). Then for all sufficiently large n, one has 

n 

^|EA, -EA^I >Kni/2 

for some k depending only on the atom distributions. In particular (by the pigeon- 
hole principle) , there exists 1 < i < n such that 

|EAi - EA^I > k' mm{i, n + 1- i)-^/^n-^^^, 

where k' > depends only on the atom distributions. 



Theorem |1.6| is not exactly comparable to the four moment theorem, as it per- 
tains to the mean of the eigenvalues Ai, whereas the four moment theorem instead 
controls quantities such as the median. However, it is expected that the mean and 
median of Ai should be quite close (in particular, closer than the expected eigenvalue 



spacing (HI)), but the best known concentration results for A^ (such as Theorem 1.3) 



are not strong enough to establish this yet. If one assumes that the mean and 



median are sufficiently close, then Theorem 1.6 is strong evidence that the four 



moment theorem breaks down if one only assumes three matching moments. 

The three moment theorem implies (roughly speaking) that the medians of Aj, A^ 
should only differ by 0(n~^/^+'^"), for arbitrarily small cq > 0. In view of this, 
one expects the index i provided by Theorem |1.6| to lie in the bulk region 6n < 



i < (1 — S)n, and indeed the conclusion Theorem 1.6 should in fact hold for most 
i in this bulk region. However, we were unable to demonstrate this. Nevertheless, 
concentration bounds such as those given earlier in this section should be able to 
establish some non-trivial lower bound on min(z, n + I — i). 

The question that how each particular eigenvalue reacts to a change in the fourth 
moment looks very interesting. By utilising higher moments and making some 
heuristic arguments (see the last section of the paper) we are led to the following 
precise conjecture. 

Conjecture 1.7 (Conjectured asymptotic). For Sn < i < {1 — S)n with 5 > 
fixed, one has 

EA, = ni/2^, + n-i/2C,,„ + 7^(7' " 2^^)'Ev^ + Os{n-^^^--) 

for some absolute constant c > 0, where Ci^n is some bounded quantity depending 
only on i,n (and is in particular independent of rj). The same statement should 
also be true for the median MA^ . 
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We ran a numerical test to check the conjecture. We generated two random 
matrices models with whose entry's distributions are Gaussian A^(0, 1) and Laplace 
(0, 1/a/2) and investigated the behaviour of the difference EAi — EA^ as a function 
of i. 




50 100 150 200 250 300 350 400 450 500 



Figure 1: Comparison between 2 curves /i = ^^/''^ et'^-e? '^ ^'^"^ f^ ^ ^f ~ '^^' 



{n = 500) 



This conjecture would imply that if one increases the fourth moment, then (in 
expectation) those A^ with 7^ < — -\/2 or < 7^ < \/2 are shift to the left (de- 
creasing), while those A^ with 7^ > ^2 or > 7^ > —a/2 are shifted to the right 
(increasing). In other words, the eigenvalues in the middle move toward the center 
of the spectrum, while those closer to the edge move outward. 



We prove Theorem 1.6 in Section [3] Apart from Theorem 1.3 the main ingredi- 
ent is a standard moment computation (Lemma 3.1) that compares X^iLi ^t with 
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Notation. We use the usual asymptotic notation as n — > oo, thus 0{f{n)) denotes 
a quantity g{n) bounded in magnitude by Cf{n), and o{f{n)) denotes a quantity 
g(n) bounded in magnitude by c{n)f{n), where c{n) — > as n — >■ oo. If we need, c, C 
to depend on additional parameters, we indicate this by subscripts, e.g. Ok{f{n)) 
is a quantity bounded in magnitude by Ck{n)f{n), where Ck{n) — )■ as n -^ oo for 
each fixed k. 



2. Proofs of Theorems 11J3J and IL4 



We now prove Theorem |1.3| We may assume without loss of generality that e > 
is small; all implied constants can depend on e, and we assume that n is sufficiently 
large depending on e. 

We first observe that the claim is true in the edge cases i ^ l,n. Indeed, in those 
cases the estimate ^ (or ([s])) gives 

A,(Af„)=MA,(Af„) + 0(n^) 

with probability 1 — 0(n^^'^°). On the other hand, the Tracy- Widom law for Wigner 
matrices (see [19]) gives 

MA,(M„)=7»%A^+0(n-i/6) 

at the edge, and the claim follows (with plenty of room to spare). 

We may now reduce to the bulk case min(i,n — i) > n^"+'. Indeed, one can 
deduce the edge case 1 < i < n^/^+^ from the bulk case by setting iq to be the least 
integer greater than ri^"+'^, and using the crude pointwise bound 

|Ai(M„) - v^7,| < \Xi{Mn) - ^^7l| + |Aio(M„) - y^-fi„\ + |^^7l - ^^7,J 

and observing that ^/nji ~ V^Jio = 0{n^^^~^'^^^). Similarly to deal with the case 
n- n(^+'^)/2 < i <n. 

Henceforth we fix i with min(j,n — i) > n^'^+^. The next step is to verify the 
theorem in the model case that M„ is the GUE random matrix ensemble. In this 
case, much sharper concentration results are known. Indeed, we have 

Lemma 2.1 (Concentration for GUE). Let M„ be a GUE matrix, and let I cM. be 
an interval. Let Nj be the counting function Nj :— {1 < i < n : -^Ai(Af„) G /}. 
Then one has 

Pi\Ni - n f psc{x) dx\ > n') < n-^°° 

(say) uniformly in I, if n is sufficiently large depending on e. 

Proof. This follows from the fact that the number of eigenvalues of GUE in / 
can be expressecrl as the sum of independent random variables (see p^ Corollary 
4.2.24]), with variance of logarithmic size (see [121 Lemma 2.3]) and thus strongly 



In fact, in the GUE case Ni has a binomial distribution, though we will not need this fact 



here. 
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concentrated. For details, see [2j. In the bulk region / C [—2 + 6,2 — 6], this type of 
result (for more general Wigner ensembles) was established in [SJ Theorem 6.3]. D 



From this lemma and a standard computation, we conclude that in the GUE case 
one has 

(13) A,(M„) = ^A^7i+0(r^'^min(i,r^ + l-i)-l/3„-l/6) 

with probability 1 — 0(n^^°"). From this bound (and using very crude estimates 
to control the tail event of probability 0(n~^°°), e.g. controlling Ai(Af„) by the 
Frobenius norm of M„), we have 

E|Ai(M„) - Vnl^\^ = 0{n^ min(i, n + 1- i)-i/3„^i/6)2 _^ ^(^-lO) 

(say), which easily implies ([9| (with some room to spare). 

Remark 2.2. In the bulk case 6n < i < {1 — 6)n, a result of Gustavsson \TT shows 
the related statement that Ai(Af„) for GUE is asymptotically normally distributed 
around y/n'^i with variance , ._°^2? ; which is consistent with the above concentra- 
tion results. Using the four moment theorem, the result of Gustavsson was extended 
to other Wigner matrices in [18 . In particular, this gives a bound 

MA,(M„) = Vri7. + o(^^ip^) 

for the median uniformly in the bulk region 6n < i < {l — 6)n for fixed 6 > 0, when- 
ever the atom distribution of M„ has vanishing third moment. As a consequence of 
the recent results in "8], a similar result (with n"^ instead of ^log n) holds without 
the vanishing third moment hypothesis. 



Now we pass from the GUE case to more general Wigner matrices with vanishing 
third moment. The main tool here is the three moment theorem (the second part 



of Theorem 1.5 1. We will also need a weak version of Lemma |2.1| in the non-GUE 
case: 

Lemma 2.3 (Weak concentration for Wigner). Let Af„ be a Wigner matrix, and 
let I d R be an interval. Let Nj be the counting function Nj := {I < i < n : 
-i=Xi(Mn) G /}. Then for any fixed e > 0, one has 



P{\Ni -n psc{x) dx\ > ni/2+^) < 



n-i™ 



(say) uniformly in L, if n is sufficiently large depending on e. 



Proof. This follows from the concentration of measure approach first developed 
in [TT], and then modified in [201 Appendix F] to deal with exponentially decaying 
entries and with the discontinuous nature of the indicator function 1/. As mentioned 
earlier, in the bulk case / C [—2 + 6,2 — 6], stronger results of this type have also 
recently been obtained in [51 Theorem 6.3]. D 
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As a consequence of this lemma and our hypothesis ram{i,n — i) > n^"+^, one 
has the bound (l7| with probabihty 1 — 0(n^^°°). In other words, we can find a 
quantity R comparable to n^' •^^^ min(i, n + 1 — i)^^'^n^^'^ such that 

(14) P(|A,(A/„) - y^7.| >R)< ri-'°". 

We now introduce the function 



G{x):=^i- 



fnR 



where -0 : E -> [0, 1] is a smooth cutoff function supported on [—2, 2] that equals 
-0(2;) :— j^x^ on [—1,1]. Note that ^JnR > ^fn. As such, for sufficiently small 
Co > (independent of e), one easily verifies that 

|V^G(a:)| ■Cn-^^"'' 

for < J < 5 and all a; G M, where C is the constant in the three moment theorem. 
We may thus apply that theorem and conclude that 

EG(^A,(Af„)) = EG(VnA,(M;j) + 0(n-^°) 

for some absolute constant cq > 0, where M^ is drawn from GUE. On the other 



hand, from ( 13 ) one easily sees that 



EG(V^A,(M;)) = -l^E|A.(Af;) - ^^,f + 0(n-i") 



(say), and similarly from (14 1 one has 

EG(V^A,(Af„)) = ^E|A,(A//„) - ^-i,\^ + 0{n-^''). 

We conclude that 

E|A,(Af„) - V^7,p = E|A,(A/;) - ^^,\^ + 0(n-^°i?2) + 0(^-1°). 
Substituting in the definition of R, we obtain Theorem |1.3[ 



Remark 2.4. From (13 1 and the three moment theorem one can also show that for 
any Wigner matrix Af„ whose atom distribution has third vanishing moment, and 
any 1 < i < n, one has 

(15) A^(A/„) = v^7i+0(n^min(i,n + l-i)-^/3„-i/6>) 

with probability 1 — O^in^'^) for some absolute constant c > 0; in particular one 
has 

MAj(A/„) = y/^-f, + Oein" min(i, n + 1- iy^'^n^^'^). 
We omit the details, which are similar to the above calculations and also to the 



proof of [HI Theorem 32] (which is essentially the bulk case of ( 15 )). Heuristically, 
this suggests that one can take c = 1 — e in ([9]) , which would be consistent with the 



results in [S]; however, the available bound 0{n^^) of the tail probability for (15 1 is 
too weak to make this heuristic rigorous. Unfortunately, even if one assumes more 
than three matching moments, the methods of proof in [IB], [IH] do not seem strong 
enough to establish this conjecture; the main technical obstacle arises from the need 
to truncate away the event that an eigenvalue gap such as Ai+i(A/„) — \i[Mn) is 
unexpectedly small, such events occur with a probability of size 0{'nr'^) but no 
better. 
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Remark 2.5. The vanishing third moment was needed since we compare to GUE. 
One can omit this assumption if one can extend Lemma |2.1| to Johansson matrices 
(i.e. Wigner matrices whose atom distribution is gauss divisible, see [I3j). In fact, 
one only needs this lemma with some e < 1/2. A similar remark applies to the 



proof of Theorem 1.4 below. The techniques in the recent paper [8] also implies a 



version of Theorem 1.3 in which no condition on the third moment is required (but 



with a slightly different right-hand side). 



2.1. Proof of Theorem 1.4 We now prove Theorem 1.4 The method of proof 



is only a slight variant of that used above. 

Fix X, and let e > be a small absolute constant to be chosen later. From Lemma 
12.31 one has 

ix- R< N[^2,t] <ix+ R 
with probability 1 - 0{n-^°^), where R = 0{n^/^+^) and 

ix -^n Psc{t) dt. 

J —OQ 

Inside the interval Ix ■— [ix — R,ix — R](^ [1, n], we locate m = 0(n^/^"'"^'^) integers 
ii, . . . ,ijn such that every integer in I^ lies within 0{n^''^~'^) of one of the ij. Then 
with probability 1 — 0{n~^'^'^), one has 

N[-2.x] =ix-R+ V I(^A,^. (Af„) < x) 

i^-R<3<i^+R ^ 

where 1{E) is the indicator of an event E, and we adopt the conventions that 
Xi{Mn) = — oo for i < 1 and Ai(M„) = +00 for i > n. Taking expectations, we 
conclude that 

Fn{x) = -{ix^R) + - V P(^A,(Af„)<x) + 0(n-i°) 

(say). Let M^ be sampled using GUE. Using the three moment theorem as in [T51 
Corollary 21], we have 

P(^A,(Af„) <x)< P(^Aj(A/;) <x + n~^+'') + 0(n-^) 

uniformly in j, for some small absolute c, c' > independent of e. Using this bound, 
we conclude that 

Fn{x) < -iix-R)+- V P(^A,(M;) < x+n-'+-')+0{-n--)+0{n-'") 

The second error term is 0(n^^' ^^'^) if e is small enough depending on c. On the 
other hand, using Lemma |2.3| for M^^ instead of Af„, we conclude from a variant of 
the above arguments that 

F.;^{x + n-^+'') = -ii,-R) + - y P{^\,{AO<x + n-^+') + 0{n~^°) 

n n ^ — ' vn 

i:n-R<j<ini+R 

where F^ is the counterpart of F„ for M'^ instead of Af„ . We conclude that 
F^x) < F'^{x + n-i+^') + 0(n-i/2--); 
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a similar argument also gives 



On the other hand, from Lemma 2.1 one easily sees that 

J —OQ 

for all X, and the claim follows. 

Remark 2.6. Using the three moment theorem, one can control the median 'M.Ni2,x] 



with much higher accuracy than the mean ENi_2^x] ■ Indeed, using ( 15 ) it is not 
difficult to show that 

MNi = n I psc{x) dx + Oein") 

uniformly for all intervals / and any e > 0, assuming vanishing third moment of 
the atom distribution; we leave the details to the interested reader. In view of this, 
it is reasonable to conjecture that one can take c arbitrarily close to 1 in Theorem 



1.4 In ^, this claim is established in the bulk region / C [—2 + 5,2 — 6]. 

3. Proof of Theorem 11.61 



We now begin the proof of Theorem 1.6 Let M„, M^, Ai, A^ be as in that theorem. 



The starting point is the following fourth moment calculation: 

Lemma 3.1 (Fourth moment calculation). Set kq := £(77*) — £((77')^), thus kq y^ 
by hypothesis. Then 



^FiA^l-F/fy^" 

i=l 



E(A*)-E((A^r) = 2^o(n^-n). 



Proof. We expand 



5^E(Af)=E(traceM4) 

= 2_^ ^CabCbcCcdCd 



Of course, there is a similar formula for X]"=i ^K^^ i'^ which the (^ij are replaced 

by a- 

Consider the four sets {a, b}, {b, c}, {c, d}, {d, a}. If one of these sets occurs with 
multiplicity one, then the expectation 'Ed^abCbcCcdCda vanishes from the mean zero 
and independence properties of the coefficients of M„. If instead one has two pairs 
of sets occuring with multiplicity two, then the expectation of 'Ei(^abCbcCcdCda is equal 
to that of 'ECabC'bcC'cdCda- From this we see that 



^E(Af)-^E((A,:r)=2 Y. EdCa^n-EdC 



f |4^ 

b\ 
l<a<b<n 
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But a short calculation (using the fact that rj, rf match to third order) reveals that 

E(|an - E(|Cn = 2(E(r;4) - £((,?')'))• 
The claim follows. D 

The value max{|Ai|, |A„|} is called the spectral norm of Af„ and will be denoted 
by ||M„||. The following result is well-known: 

Lemma 3.2 (Concentration of the spectral norm). For any A > 0, one has 

P(||M„||>3ni/2) = 0^(n-^). 

In particular, 

P(|A.(Af„)|>3ni/2) = 0^(n-4) 

and 

E|A,(M„)|^ = 0^(n^/2). 

Proof. This follows easily from (l5]) or (|6|, combined with ^ (as well as using 
crude estimates, such as Holder's inequality, to deal with the rare tail event in 
which the estimates (Isl) or (|6]) fail). Note that this argument allows us to replace 
the coefficient 3 in the above large deviation inequality by 2 + o(l), but we will not 
need this improvement here. For even sharper concentration results, see the recent 
paper [Sj. D 



We can now invoke Theorem 11.31 to establish 
Proposition 3.3 (Fourth moment concentration). We have 

n 

I J2 E(Af) - (v^7.)' - 4(Vr^7»)'(EA, - Vnj^)] = 0{n^-^) 

4=1 

for some absolute constant c > 0, and similarly for A^. 

Proof. We begin with the Taylor expansion 

Xt - (0^74)^ + 4(V^70'(A, - EA,) + 0(|A, - V^7,|'(|A.| + ^7.)')- 

we see that with probability 1 — 0{n^^'^'^) (say), we have |Ai| + 



3.2 



From Lemma 

^/nji = 0{^/n). Taking expectations and summing using Theorem 1.3 (and using 
crude estimates, such as Holder's inequality, to deal with the tail event of probability 
0(n~^"")) we obtain the claim. D 



Combining Proposition |3.3| with Lemma |3.1| and the triangle inequality, we con- 
clude (for n large enough) that 



\Y,Hm^~'EK)\>\^^o\n' 



i=l 

Since 7^ = 0(1), Theorem |1.6| now follows from the triangle inequality. 



Remark 3.4. One can use [HI Theorem 7.1] as a substitute for Theorem 1.3 in the 
arguments above. 
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4. Higher moment computations 



In this section we discuss a higher moment computation that lead to Conjecture 



4.2 We will restate this conjecture at the end of this section for the reader's 



convenience. 



Lemma 4.1 (Higher moment computations). Af„, M^, A^, X[ be as in Theorem 
Set kq '■— E(77^) — £((77')^), and let k > be an integer. Then we have 



1.6 



5]E(A,^) - EiiX'^)'^) = {2D^k-2)/2^^o + Ok{n-'))n'^ 



hll 
1=1 

where the modified Catalan number Dm is defined to be equal to 

^2m + 2\ _ {2m + 2)1 



^^^^ ^"'^\m-lj (m-l)!(m + 3)! 

when m — 1,2, ... is a positive integer, and D„i = otherwise, thus 

Do = 0; Di^ 1; D2 = 6; D3 = 28; D4 = 120; . . . 
(OEIS sequence A002694^). 

Proof. This is a standard moment method computation (which was the method 
used by Wigner to prove the semi-cirlce law [2|; this is also related to the genus 
expansion from string theory). We have 



i=l 

and similarly for Y^'Li E(A-)* 



l<ai ,...,c 



■•^^0102 ■ • ■ ^a^ai 



Consider the k sets {01,02}, {a2, 03}, ..., {ofe, ai}. If one of these sets appears 
with multiplicity one, then the expectation vanishes. If none of the sets appears 
with multiphcity at least four, then the contributions to X]"=i ^i^i)'' ~Y^^=i ^(^9*^ 
cancel each other out. Thus the only terms that survive are those in which each set 
appears with multiplicity at least two, and at least one set appears with multiplicity 
four. In particular, there are at most {k — 2)/2 distinct sets {0^,0^+1} (with the 
convention afc+i = oi), and thus at most fc/2 distinct values of o^. 

If there are fewer than k/2 distinct values of Oi, then the total contribution here is 
easily seen to be Ok{n^^'^~'^), which is acceptable. Thus we may restrict attention 
to the case when there are exactly fc/2 distinct values of o^, which forces there 
to be exactly (fc — 2)/2 distinct sets {ai,aij^i}, and furthermore the (connected) 
graph formed by these edges cannot contain any cycles and is thus a tree. Finally, 
each set {ai, 0^+1} must appear with multiplicity two, with the exception of one set 
that appears with multiplicity four. The summand E<^aia2 • ■ ■ Qa^ai is then equal to 
2'Et]^, and similarly E^^as ■ • ■ C.ai is then equal to 2E(7y')4. 

We now assign each Oi a label j = j{ai) from 1 to fc/2 by order of appearance; 
thus Oi will be assigned a label j{ai) of 1, the first o^ that is distinct from oi will 
be assigned a label of j(ai) = 2, the first O; has not already been labeled 1 or 2 will 
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be labeled 3, and so forth. The closed path 7 — {{j {ai) , j (02)) , • ■ • , (i(afe/2),i(ai))) 
then traverses a tree T-y of {k — 2)/2 edges spanning the vertices {!,..., fc/2}, where 
the path 7 traverses each edge of T^ with multiplicity two, with the exception of one 
edge of Tj that is traversed four times. Furthermore, the path 7 only encounters a 
vertex j in {1, . . . , k/2} after it has first encountered 1, ... ,j — 1; in particular, the 
starting (and ending) vertex of 7 is necessarily 1. 

Call a closed path 7 = ((ji, J2), • • • , (Ja;/2,.?i)) of length fc/2 in {l,...,fc/2} 4- 
admissible if it traverses a tree of (fc — 2)/2 edges spanning {1, . . . , fc/2}, so that 
each edge is traversed twice with the exception of one edge that is traversed four 
times, and such that each vertex j is encountered only after encountering 1, . . . , j — 1. 
It is not hard to see that each such 4-admissible path contributes 

(n'=/2^0fc(n'=/2-i))x2E7?4 
to X]r=i ■^(^«)'^i s-iid similarly contributes 

to J27=i E(^i)'^- Subtracting, we see that each 7 contributes 

to Er=iE(Aj)'' - Er=iE(A-)*=. Thus, it will suffice to show that for any m, the 
number of 4-admissible paths on trees of m edges is equal to D^- 

The claim is trivial unless ?n is a positive integer. We observe the recurrence 

rn— 1 

(17) D„, = 2 J2 C,Dj+ J2 aCjCkCi 

i,j>0:i-\-j—m—l i,j,fc,/>0:i+j+/c+/— m— 1 

for m = 1, 2, . . ., where 

(2m)! 



(_/ . — 



m\{m -\- 1)! 



^m . 


1- VI- 


-4x 




2x 




,m _ 


(i-\/r^ 


-^xf 



are the Catalan numbers, thus 

Co = 0; Ci = 1; C2 = 2; C3 = 5; C4 - 14; 
Indeed, writing 

c(x) := ^ C„ 

and 

00 

m— 

a brief calculation shows that 

d[x^ = 2xc(x)d{x") + xc(x) 
whence the claim. 

Call a path ((ji, J2), • • • , {J2rmji)) 2-admissible if it traverses a tree of m edges 
spanning {1, . . . , m + 1}, such that each edge is traversed exactly twice, and each 
vertex j is encountered only after encountering 1, . . . , j — 1. It is a classical fact 
that the number of 2-admissible paths is Cm- 
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Now suppose inductively that there are Dj 4-admissible paths on trees of j edges 
for aU j < m. It suffices to show that the number of 4-admissible paths on trees 



of m edges is given by the right-hand side of (17). To do this, consider the first 
edge (ji, J2) of an admissible path 7 on a tree with m edges. This edge is traversed 
either two or four times. Suppose first that it is traversed two times. Then one can 
split 7 into the following pieces: the first edge {ji,J2), a (relabeled) 2-admissible or 
4-admissible path on a tree with i edges that starts and ends at j2, a return edge 
(J2, ji), and a (relabeled) 4-admissible or 2-admissible path on a tree with j edges 
that is disjoint from the first tree that starts and ends at j'l, where i,j>0 add up 
to m — 1. This case gives a net contribution of 

i,j'>0'.i+j—m—l 2,j>0:i+j— rn — 1 



which is the first term of (17) 



Now suppose that (ji, J2) is traversed four times. Then we can split 7 into the 
following pieces: the first edge (ji, J2), a (relabeled) 2-admissible path on a tree with 
i edges that starts and ends at J2, a return edge (J2, ji), a (relabeled) 2-admissible 
path with j edges that starts and ends at ji, a repeated edge (ji, J2), a (relabeled) 
2-admissible path with k edges that starts and ends at J2, a repeated return edge 
(J2, ji), and a (relabeled) 2-admissible path on a tree with I edges that starts and 
ends at ji, where all trees are disjoint (except at their roots) and i,j,k,l > add 



up to TO — 1. This gives the second contribution to the right-hand side of ( 17). D 



We now repeat the arguments from the previous section. A routine generalisation 
of Proposition |3.3| yields the bound 



\Y^i^^) - (^7^)'' - fc(V^7^)'"'(EA, - y^7^)l = Okin'^-') 

for any fc > 1 and some absolute constant c > (note that the left-hand side 
vanishes for fc = 1). We conclude from this and Lemma|4.1|that 



1 " 
(19) - Y kjt^VnCEX, - EA^ = 2D^k-2)/2>io + Ofe(n-=). 



n 



Next, we observe from (18) that 



y^ D(k-2)/2 ^ (z-yj^^4)4 
fc=0 



leVz 



for z near infinity, where we pick the branch of the square root of v z^ — 4 that 
equals z near infinity and is analytic away from the interval [—2,2]. Then the 
right-hand side continues analytically to the exterior of this interval. Calling this 
analytic function /(z), we compute the jump formula 

f{x + ib)- f{x~ib) 
lim ~ nix) 
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where 5 : E — j' E vanishes outside of the interval [—2, 2], and is equal to 

1 x" - 4a;2 + 2 
9[x) 



2tt ^4^ 



r.2 



on this interval. From the Cauchy integral formula we conclude the moment formula 

D(k-2)/2 = / 9{x)x'' dx 

for fc = 0, 1, 2, . . .. The antiderivative of g{x) is 

- — {x^ ~ 2x)y/i-x^ = -^{x^- 2x)psc{x) 
8tt 8 

so by an integration by parts we have 

D{k-2)/2 = / g(a;^ - 2x)kx*' ^ Psc{x)dx. 

By Riemann integration (or more precisely, the trapezoid rule) , the right-hand side 
is equal to 

1 " 1 

for some absolute constant c > 0. 
Thus if we introduce the normalised shift 



we can rewrite ( 19 1 as 



n 

(20) -Vfc7'"\s, -Ofc(n-^). 

n ^—' 

2—1 

This suggests (but does not rigorously proven) that the Sj are small, of size 0(ri~'^), 
at least in the bulk region 5n < i < [1 — 5)n. In particular, we are led to the 
conjecture (Conjecture |4.2[ ) that 

(EA, - EA^) = -^(7? - 27,0'*o + 0(n-i/2-^) 

and this in turn suggests the following asymptotic for the expected value of \i : 

Conjecture 4.2 (Conjectured asymptotic). For Sn < i < {1 — 5)n with 5 > Q 
fixed, one has 

EA, = ni/27, + n-i/2Q,„ + -^(7f - 27,)Er;4 + Os{n-^'^--) 

for some absolute constant c > 0, where Ci^n is some bounded quantity depending 
only on i,n (and is in particular independent of rj). The same statement should 
also be true for the median MAi . 



Specifically, the difficulty is tfiat there could be cancellation between nearby values of Si. If 
one could show some assertion to the effect that Si ~_Sji when i, i' are close together, then this 
would go a long way towards establishing Conjecture 4.2 below. 
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The bound on Ci^„ is plausible in view of results such as Lemma 2.1 It should in 
fact be possible to obtain (at least conjecturally) a precise value for Ci_„ from an 
analysis of the GUE case. Such an asymptotic would demonstrate more precisely the 
dependence of the i*'' eigenvalue on the fourth moment Eirj^ at the scale 8(n~^/^) 
of the mean eigenvalue spacing. 
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